feat: florencev2 fine-tuning meta tool #190

Dayof · 2024-08-06T23:17:45Z

Motivation

This meta tool enables the possibility to fine-tune the florencev2 model using one of the tasks: CAPTION, CAPTION_TO_PHRASE_GROUNDING and OBJECT_DETECTION.
florencev2_fine_tuning calls LandingAI public api /v1/agent/jobs/fine-tuning which is responsible to launch a fine-tuning job in LandingAI environment.
It will be possible to check the job status using the fine-tune job id, and also run inference with it.

Local test

OBJECT_DETECTION

img_path = sys.argv[1]
agent_gpt4t = VisionAgent(verbosity=2)
prompt = """fine-tune florencev2 with the following bounding boxes: 
[{'image': 'cereal_3.jpg', 'labels': ['screw'], 'bboxes': [(713, 1363, 909, 1567)]}]"""
agent_gpt4t(prompt, img_path)

Code generated:

from typing import *
from vision_agent.utils.execute import CodeInterpreter
from vision_agent.tools.meta_tools import generate_vision_code, edit_vision_code, open_file, create_file, scroll_up, scroll_down, edit_file, get_tool_descriptions, florencev2_fine_tuning

florencev2_fine_tuning([{'image_path': '/home/dayoff/Downloads/cereal_shankar/cereal/train/cereal_3.jpg', 'labels': ['screw'], 'bboxes': [[713, 1363, 909, 1567]]}], 'OBJECT_DETECTION')

CAPTION

img_path = sys.argv[1]
agent_gpt4t = VisionAgent(verbosity=2)
prompt = """fine-tune florencev2 to be able to caption images, use the following bounding boxes to fine-tune: 
[{'image': 'cereal_3.jpg', 'labels': ['screw'], 'bboxes': [(713, 1363, 909, 1567)]}]"""
agent_gpt4t(prompt, img_path)

Code generated:

from typing import *
from vision_agent.utils.execute import CodeInterpreter
from vision_agent.tools.meta_tools import generate_vision_code, edit_vision_code, open_file, create_file, scroll_up, scroll_down, edit_file, get_tool_descriptions, florencev2_fine_tuning

florencev2_fine_tuning([{'image_path': '/home/dayoff/Downloads/cereal_shankar/cereal/train/cereal_3.jpg', 'labels': ['screw'], 'bboxes': [(713, 1363, 909, 1567)]}], 'CAPTION')

CAPTION_TO_PHRASE_GROUNDING

img_path = sys.argv[1]
agent_gpt4t = VisionAgent(verbosity=2)
prompt = """fine-tune florencev2 to be able to turn caption to phrase grounding, use the following bounding boxes to fine-tune: 
[{'image': 'cereal_3.jpg', 'labels': ['screw'], 'bboxes': [(713, 1363, 909, 1567)]}]"""
agent_gpt4t(prompt, img_path)

Code generated:

from typing import *
from vision_agent.utils.execute import CodeInterpreter
from vision_agent.tools.meta_tools import generate_vision_code, edit_vision_code, open_file, create_file, scroll_up, scroll_down, edit_file, get_tool_descriptions, florencev2_fine_tuning

florencev2_fine_tuning([{'image_path': '/home/dayoff/Downloads/cereal_shankar/cereal/train/cereal_3.jpg', 'labels': ['screw'], 'bboxes': [(713, 1363, 909, 1567)]}], 'CAPTION_TO_PHRASE_GROUNDING')

Extras

Add pre-commit

…lorence-fine-tuning

dillonalaird

looks good!

Dayof added 10 commits August 6, 2024 19:57

add nptyping

66a1478

add florencev2 fine tuning

687203f

Merge branch 'main' of github.com:landing-ai/vision-agent into feat/f…

c63c752

…lorence-fine-tuning

add function name back

57845ee

fix linting

11d5c58

add pre-commit

630eff3

mypy

5eb26ef

resolve mypy

e201d43

add task customization

fa2452a

tools to meta tools

c9ab90a

Dayof changed the title ~~feat: florencev2 fine-tuning~~ feat: florencev2 fine-tuning meta tool Aug 7, 2024

fix imports

f391b44

dillonalaird approved these changes Aug 7, 2024

View reviewed changes

remove nptyping

cb31101

Dayof merged commit ae97907 into main Aug 7, 2024
8 checks passed

Dayof deleted the feat/florence-fine-tuning branch August 7, 2024 20:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: florencev2 fine-tuning meta tool #190

feat: florencev2 fine-tuning meta tool #190

Dayof commented Aug 6, 2024 •

edited

Loading

dillonalaird left a comment

feat: florencev2 fine-tuning meta tool #190

feat: florencev2 fine-tuning meta tool #190

Conversation

Dayof commented Aug 6, 2024 • edited Loading

Motivation

Local test

OBJECT_DETECTION

CAPTION

CAPTION_TO_PHRASE_GROUNDING

Extras

dillonalaird left a comment

Choose a reason for hiding this comment

Dayof commented Aug 6, 2024 •

edited

Loading